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Method and computer program for digital image processing * 
for two-dimensional electrophoresis. 

5 DESCRIPTION 

FIELD OF THE INVENTION 

The present invention relates to methods and devices for digital image processing of 
biologlca! sample separations. More specifically it relates to techniques for processing 
10 digital Image data for a two-dimensional array of sample substance spots and marker 
.substance spots in an electrophoresis gel by image normalisation, 

BACKGROUND OF THE INVENTION . . 

In a cell obtainable from e.g. a cell culture or tissue sample an existing pool of 
15 proteins, a prbteome,. exists as a part of biological processes and functions. In many 

* different clinical, diagnosdcal and analytical situations it is desirable to gain knowledge 
about the identity and amount of a protein of a group of proteins at a certain pofnt in 
time, and also of certain changes over time. Certain biological processes are defined 
by changes in morphology and physiology due to changes in the expression, Le. the 

20 protein level, of particular genes. Also, developmental stages of cells can be defined 
and. monitored by their global pattern of gene expression and the progressive changes 
that occur oVer time of particular genes or groups of proteins. Even more, as a 
response to treatment of cells with chemical factors, e.g. drugs, hormones, nutrient 
factors, environmental factors and other growth condition factors, specific proteins of 

25 groups of proteins can change their expression and as such allow for monitoring or 
identifying a response or treatment. Information of this type can be used to e.g. 
detect, identify and classify tumours in terms of malignancy, evaluation of therapies, 
adjustment of therapies as well as diagnosis and prognosis. Also, changes in proteins 
due to mutations, cleavage, phosphorylation or glycosylation can be obtained. Further 

30 identification of unknown proteins can e.g. be done by mass r spectrometry of 
unidentified sample protein spots on the gel. 

A general method used for detecting, identifying, monitoring and quantifying the 
compositions of complex biological mixtures is two-dimensional gel electrophoresis. In 
35 such a method the separation performed in two dimensions enables detection and 

* identification of a large number of components that would not be separable and 
distinguishable in a one-dimensional separation, such as a linear separation. A 
frequently used method for monitoring the. protein expression by 2D (two- 
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dimensional) gel electrophoresis comprises the feature of analysing a set of multiple 
gels in one run. 

A typical prior art method is detection and identification of sample spots in a 2D 

5 polyacrylamide gel. By using large gels, the number of proteins to be detected and 
Identified can be several hundred up to about 10 000 In one gel. Each spot is detected 
by means of a signal derived from the individual spot. In a typical gel size of 24x18 
cm, several thousands of sample protein spots are detected. Normally, about 10% of 
the total number of sample spots need to be selected as so-called. landmarks. After a 

10 separation process has been performed, the spatial relation between these landmarks, 
or markers, and the sample spots is measured and used for the purpose of 
Identification of the sample substances, in a manner well known tp the skilled person. 
This further Involves a few hundred manually selected landmarks In each gel. The 
process of detection, Identification and analysis of the separate components in a two- 

1 5 dimensional electrophoresis gel is a complex and .difficult task, and often involves 
tedious and tame consuming manual steps demanding months of experience to 
perform. Several attempts have been made to develop, means and methods to 
simpSify, standardize and sutomise the procedure as described in Smilansky et at., 
2001, Electrophoresis 22s 162§-1S26; Thompson et at., 1998, in 20* Annual 

20 International Conference of the IEEE Engineering in Medicine and Biology Society, Vol. 
20: 1060-1063; Wanatabe, et el., 1998, in 20 th Annual! International Conference of the 
IEEE Engineering in Medidne and Biology Society, Vol. 20:804-808. 

The manner in which the sample protein spots are distributed across the two- 
25 dimensional gel-electrophoresis depend on the separation parameters used in the first 
and second dimension in the two-dimensional electrophoresis procedure. Two 
parameters commonly used are isoelectric point and molecular size (Gorg et al., 2000, 
Electrophoresis 21:1037-1053). Furthermore, various ways of obtaining detectable 
signals from the spots have been used, such as post-gel colouring techniques, e.g. 
30 silver staining as described in SInha et al., 2001, Proteomics 1:853-840; Shevchenko 
et al., 1996, Anal. Chem. 68:850-858. Other techniques that are used include optical 
density, radioactive emission, fluorescence emission, and colorimetric signals (Pavon 
et al., 1999, J Interferone Cytokine Res.19: 589-599; Steinberg et al., 2001, 
Proteomics, 1:841-855; Herich etal., 2001, Biotechnlques, 31:146-149; Kemper at 
35 al., 2001, Electrophoresis, 22:970-976; Lauber et al., 2001, Electrophoresis 22:919- 
932; Berggren et al„ 2000, Electrophoresis 21:2509-2521; Steinberg etal., 2000, 
Electrophoresis 21:486-496). Detection of the spots is commonly achieved by the use 
of imaging devices that convert the detected signals into digital data arid store the 
data as information on computer storage media. 
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One of the key features In Identifying the sample : spots is the assignment and use of 
reference spots known as "landmarks". The landmarks-are actual protein spots that 
are manually selected by the user. i.e. manual landmarking. Computer software is 
further used for processing the manually selected spots in order to automatically 

5 detect and identify the same spots in all of the member gels in one matchset. • 
These user-selected landmarks are preferably relatively few in number compared to 
the total number of sample spots In an individual electropherogram. The selection 
criteria of the manual landmarks are that the spots.should be well-resolved, that they 
are well Isolated from other spots, and that they appear in all the gels of the matchset. 

10 The number of spots to be assigned as landmarks must be large enough that all of the 
remaining protein spots among the various gels will be successfully matched by the 
automatic processing. The function of the landmarks is to serve as guideposts in the 
gel-to-gel comparison, thereby aiming at reducing and compensating differences and 
distortions aniong the member gels in the matchset to assure that there will be a 

i5 . proper correspondence of protein spots among different gels in the matchset. 

Further, by using known sample proteins manually chosen as landmarks in the 2-DE . 
one has-to consider only proteins that exists In all gels In one matchset. Such protem 
must exist In a relatively large amount to be readily repeatable and detectable on aSO of 
20 the gels. Due to this, the spots chosen as protein landmartcs are often large and blob- 
like. This gives less defined landmarks that are difficult to position accurately, Le. to 
find the exact centre of the landmark, which Is a source of Inaccuracy on the analysis 

i 

process of the sample protein spots. 

25 The process of selecting and marking spots to be used as landmarks is slow and 
' tedious, and is one of the limiting factors in two-dimensional gel electrophoresis 
' analysis. Normally, since about 10% of the total number of protein spots needs to be 
selected as protein landmarks to make the image analysis algorithm work correctly, 
several .hours must be used for each run to pick said landmarks manually. Also, in 
30 addition to assigning the above described selection criteria for manually chosen 

• landmarks, the time involved in making the selections and making the spots adds to 
the cost and time involved In performing the analysis. Even further, the level of user 

• involvement raises question regarding reproducibility and reliability. 

35 Being a generally known and wjdely used technique, the state of the art comprises 
several disclosures relating to two-dimensional electrophoresis. 

US 5 139 630 teaches a method for detecting and identifying protein species in a? 
sample by capillary zone electrophoresis by the addition of at least two external • 
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markers, one being an ionic species and one being a neutral charged species. This method 
is only applicable for capillary electrophoresis, i.e. where the separation is in one 
dimension, here disclosed for charge densities, and not applicable for a separation using 
more than one dimension as in e.g. a two-dimensional gel electrophoresis. 

5 

WO 01/07920 discloses a method for automated landmarking for two-dimensional gel 
electrophoresis by the addition of marker proteins to the sample proteins.. However, the use s 
of proteins as external landmarks has several disadvantages, such as high production costs 
and short shelf life of the final protein product. Further, the use of protein landmarks, 
10 means that special means have to be taken to make such proteins visualized in contrast to. 
proteins present in a sample material. * 

US 5,073,963 teaches an interactive computerized method for matching visual patterns of 
polypeptide spots in two-dimensional (2-D) gel electrophoresis. The computerized method 

15 manipulates spot pixel coordinates using staged coordinate transformation techniques on - 
spot markers and unknown steady spots to reduce gel preparation distortions and allows a 
user to produce matching results in a manner that compares the transformed spot data using 
either a single reference gel or multiple reference gels approach for producing the 
matching results. Dominant sample spots are selected as marker spots, on which 

20 transformation is based. 

In the light of the aforementioned problems it is thus an object of the invention to provide 
means and methods for reducing time and cost relating to the process of electrophoresis. 
An aspect of this object is to increase the reproducibility and reliability of two-dimensional 
25 gel electrophoresis used for separation, detection, identification and quantification of 
proteins, and to provide a solution overcoming the problems associated with the prior art 
means and methods. 

SUMMARY OF THE INVENTION 

30 In order to fulfil the objects above, the present invention provides a method for using 
detected signals from externally added markers, having known properties, in order to 
transform a detected 2D gel image to a form in which it is comparable to other images of 
2D gels. In particular, the present invention makes use of non-protein markers, more 
specifically markers selected from dendrimers. 



5 



The method according to the present invention is defined in the appended claims. 

Needless to say, this method can be used to compare plural gels, two by two. Since the 
5 markers are selected and have known properties, this transformation and subsequent 
comparison can be performed completely automatic for two or more gels, saving hours of 
work which as such constitutes a potential source of manual errors. 



10 BRIEF DESCRIPTION OF THE DRAWINGS 

The features of the present invention will be described in more detail below by means of 
preferred embodiments and examples, with reference being made to the appended 
drawings, in which 

Fig. 1 schematically illustrates a grid of co-ordinates relating to ideal positions of landmark 
15 spots in an electrophoresis gel for given electrophoresis conditions, according to an 
embodiment of the invention; 

Fig. 2 schematically illustrates a grid of coordinates relating to the detected positions of 
landmark spots in an electrophoresis gel for said given electrophoresis conditions, 
according to an embodiment of the invention; 
20 Fig. 3 schematically illustrates an image comprising the sample spots of the same 
electrophoresis gel, according to an embodiment of the invention; 
Fig. 4 schematically illustrates a normalised image of the sample image in Fig. 3, 
according to an embodiment of the invention; 

Fig. 5 schematically illustrates how a dendrimer may be built up according to an 
25 embodiment of the invention; 

Fig. 6 shows formulas from which a dendrimer core may be selected in embodiments of 

the invention; 

Fig. 7 illustrates an example of a diaminobenzoic acid as a core in an embodiment of the 
invention; and 

30 Fig. 8 illustrates examples of monomers with monovalent and divalent branching units, 
according to embodiments of the invention. 
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DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS < 
As revealed above, the present invention relates to the method of using external 
markers In a gel electrophoresis process for normalising images from 2-D gels, making 
It possible to match sample spots between different images from 2-D gels, and to 
5 quantify sample spots in images from 2-D gels, in a more effective way than with the 
prior art techniques* 

Once a electrophoresis separation process has been performed on a gel comprising 
sample substances and landmark substances, thfese substances will be gathered In 

10 spots dependent on their properties, wherein each spot, or the centre thereof, 

corresponds to a certain parameter value pair injhe two dimensions. The resulting 
pattern of spots will hence form a 2D array of more or less discrete spots. A suitable 
technique for capturing an image of this pattern is subsequently used, such that 
signals from the spots in the array are detected, preferably the image is captured by 

15 * scanning, but may also be captured by a single s.hot. The position of the landmark 
spots is then determined from the captured image. By using a digital scanner, the 
rapturing of iiie image and the determination "of tfie landmark spot positions may be 
performed in essentially the same step- From a single shot image, the spot positions 
may be determined by subsequent scanning of that image. 

20 

In order to detect the landmark spots, i.e. the positions of the landmark spots in the 
captured image, it is advantageous if the landmark substances are easily 
distinguishable from the sample substances. As "described below this can be achieved 
in various ways. A suitable algorithm preferably is used to determine the co-ordinates 

25 for the centre of the spots for preferably all landmark spots in the gel. A data set of 
pairs of co-ordinates Is thus formed, a detected landmark data set, representing 
signals in a detected Image in an actual electrophoresis case. Each co-ordinate pair of 
the detected landmark image refers to a certain parameter value pair in the two 
dimensions of the 2D electrophoresis. One example of a landmark image, wherein the 

30 detected positions for the landmark spots are indicated, is shown In Fig. 2. . 

Furthermore, a sample Image is formed by capturing an image of the gel in which the 
sample spots are detected* As a person skilled in the art will realise, the landmark * 
image and the sample image may be derived or fprmed from the same captured ' 
35 image, or from successive captured images. The sample image Is digitally represented 
by a sample image data set comprising a signal value for each pixel in the captured 
sample Image, or global image. This global Image may or may not include the 
landmarks. Preferably, said signal value is expressed as a level of magnitude in a 
scale, such as a grey scale, relating to the exposure of the image In that particular 
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pixel. An example of a sample Image of a.separated electrophoresis gel Is shown in 
Fig. 3, corresponding to the landmark image of Fig.2. 

For different reasons, such as inconsistencies in the gel, the landmark substances in 
5 an actual electrophoresis run will rarely be located as would be expected from their 
known properties and the chosen electrophoresis conditions. This is one of the reasons 
why simple direct comparison between two electrophoresis cases is not always 
reliable, since this uncertainty naturally Is equally valid for the sample substances. 

10 According to the Invention, this problem is overcome by transforming the global image 
such that the external landmarks are positioned in the co-ordinates to which the.r 
properties in the separation processes correspond. This wa Y/ two or more 
electrophoresis gels ran under the same conditions may be independently transformed 
into a reference format, such that they may be compared to each other. The 
15 transformation feature of thejnvention is based on usage of known landmarks. One 
• feature of using landmarks selected from substances of known properties is. that when 
the conditions of the separation process of the electrophoresis are known; *lso the 
expected positions which these marker substances would ideally assume in the gel are 
known, or can be derived. By said conditions 5s here meant for instance the type. e.g. 
20 isoelectric point and molecular size, and range of parameters used to separate the 
samples in the gel, and the type of gel used. Consequently, once a set of landmark 
substances is chosen and certain electrophoresis conditions are selected, a pattern of 
discrete co-ordinates corresponding to the positions those landmark substances would 
assume, based upon their properties, can be derived. Accordingly, a data set of pairs 
25 of reference co-ordinates is thus formed, here denoted a reference landmark data set, 
representing signals in a reference image or ideal 1 image relating to an ideal 
electrophoresis case. Each co-ordinate pair of the reference image refers to a certain 
parameter value pair in the two dimensions of the 2D electrophoresis. An example of 
an ideal marker image, corresponding to the landmark image derived from the • 
30 detected landmarks as shown in Fig. 2, is illustrated in Fig. 1. By comparison between 
these two figures it will be obvious that the positions of the corresponding markers do 
not coincide perfectly. 

In order to be able to perform the aforementioned transformation, a mathematical 
35 relation between the reference landmark data set and the detected landmark data set 
. is determined, such that the co-ordinates of the landmarks, or markers, in the two 
images represented by said data sets, i.e. the reference image and the detected 
landmark image, are mapped onto each other. The mathematical relation for this 
mapping Is calculated from the position differences between corresponding co-ordinate 
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pairs in the respective sets, but at the same time the mathematical relation is continuous for 
the entire reference image. 



Each pixel of the transformed image is then created by collecting the value for the 5 
5 corresponding pixel in the detected global image according to the following basic 
procedure. A first pixel in the image plane of the reference image is mapped by said 
mathematical relation into the image plane of the detected global image. If th. Jlobal image 
is distorted compared to what would have been expected under the present conditions, the 
co-ordinates of the mapped pixel will not be the same as the original pixel in the reference 
• 10 image plane. Once the mapped pixel co-ordinates have been calculated, the signal value of 
that co-ordinate is read in the sample image data set, and that signal value is thereafter 
assigned to said first pixel in the reference image plane. Each part of the reference image: 
plane is then built up pixel by pixel to form a complete transformed image of the detected 
global image. Consequently, the entire detected image of samples is transformed by this 
15. mapping. The detected image is then globally transformed. An example of a such a 

globally transformed, normalised, image is illustrated in Fig. 4. In fact, the image in Fig. 4 
" is transformed ftom me detected sample image of Fig. 3, usmg a mathematical relation 
derived ftom the relation between the ideal and detected landmark images of Figs 1 and 2, 
respectively, in accordance with an embodiment of the invention- All electrophoresis test 
20 items separated by this specific type of gel and strip, and having the corresponding set of 
landmarks, will in this globally transformed format have their landmarks placed on exactly 
the same positions or co-ordinates. A specific example of determining a mathematical 
relation for mapping the landmark images onto each other is described further below. 

25 In a preferred embodiment of the invention, external landmarks are used. Preferably these 
external landmarks are artificial molecules which are not proteins, as disclosed in the 
related patent application SE 01 03 103-8, which is the priority document of 
WO 03/025581. Several technical effects are obtained by the invention. The analysis of the 
detected image is automised in a higher degree than in the prior art solutions. By adding 

30 external knowledge, i.e. substances of known properties, the time-consuming manual 
placing of landmarks in the image, which is also a potential source of errors, in order to 
obtain a match with another image, is eliminated. Instead, each image is independently 
processed into a format-in which it is comparable with any other processed image carrying 
the same landmarks. 
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Using selected artificial landmarks further brings'about the effect that landmark 
substances can be selected such that they form a desired pattern over the image, e.g. 
with substantially equal distance between each neighbouring landmark spots, resulting 
In a more even accuracy over the entire image and a more accurate transformation. 
5 The proteins functioning as landmarks In the prior art are spatially rather undefined 
and differing from time to time, which results in iin uncertainty in the positioning in 
the landmarks from a spot. Detection of the artificial landmarks is more secure since 
the same amount of a certain landmark substance is always selected, wherein the spot 
will essentially always look the same. Therefore, the algorithm used will determine the 
10 co-ordiriates in essentially the same way each time, rendering a more accurate 
positioning. 

According to an embodiment of the invention, the marker compounds suitable for gel 
electrophoresis are added externally, and are not chosen from the samples, as 
15 revealed above. The marker compound may be a polymer or it may be selected from . 
the group consisting of glycoconjugates, carbopeptoides, polynucleotides, proteo- 
glycan^, fulSerenes, carbohydrates and mixtures thereof. 

In specific embodiments, the marker compound is characterised by a pi of about 2-12. 
2fj In still further embodiments, it is characterised by a pi of about 3-10. 

Further embodiments include marker compound characterised by a Mw of about 5-10 6 . 
Da. In still even further embodiments, the Mw may be of about 10 3 -10 s Da, 5-1000 
Da, 5-600 Da, 5-250 Da or an otherwise suitable interval of molecular size due to the 
, 25 size of the samples to be separated and detected; Specific embodiments comprise 

marker compounds characterised by a pi of abouj 1-12 and by a Mw of about 100-10 6 . 

.* 
i 

In one embodiment the marker compound is a dendrimer. Dendrimers are built up 
from different structural building blocks, some that may Include branching points to 
30 achieve a tree like structure characteristic for dendrimers. The different building blocks 
are coupled together to achieve said treelike structure. 

The dendrimers offer two important features when used as marker molecules, namely 
a) the size can be easily modified by adding more layers and b) the feature of the 
35 dendrlmeric compounds can be modified by adding different functional groups to the 
layers. Since the individual building blocks may be composed of repeatable units, the 
dendrimer molecule may be built up from only a few numbers of coupling steps, which 
Is economically and technically advantageous.. 
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According to the invention, the dendrimer may comprise at least one monomer, at least one 
functional group and optionally at least one core as building blocks. The dendrimer may be 
built up from separate building blocks as shown in figure 5, which is identical with figure 2 
in the patent application SE 0103103-8 referred to above. Said dendrimers may be built up 
5 either by a divergent strategy or by a convergent strategy as known in the art of dendrimer 
synthesis. 

In one embodiment, the dendrimer to be used as external landmarks according to the 
invention is synthesised according to the divergent strategy. 

10 

The dendrimer may be represented by the general formula 
(core) „ (monomeri_ 0 )x (functional group i.„ p ) 

wherein n is an integer from 0-5 representing number of different co-existing optional 

- 15 cores, ' " ' - ' ' 

wherein o is an integer from 2-1000 representing number of different monomer building - 
* blocks within the monomer distributed over x layers, 
- wherein x is an integer from 1-20 representing number of layers, and 
wherein p is an integer from 1-20 representing the number of different functional groups 
20 within one functional group building block. 

Other suitable dendrimers to be used as landmarks according to the invention are - 
commercially available dendrimers. Though, such dendrimers lack at least one functional 
. group according to the invention. Such a functional group may then, of course, be coupled 
25 onto the commercially available dendrimers using techniques known to the skilled man in 
the art Examples o£ but not limited to, commercially available dendrimers that may be 
used according to the invention are Astramol™ (DSM Agro, The Netherlands) and 
-Starbust® (Aldrich). 

30 In specific embodiments of the invention, synthetic amino acid dendrimers will be used. In 
still a specific embodiment, a diamin or a triamin is used as a core, diaittinobenzoic acid as 
a monomer and aspartic acid as a functional group. By introducing different numbers of 
diaminobenzoic acid as a monomer a huge range of molecular masses may be achieved as 
exemplified in the table below. 
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Molecular size of dendrimers made of a diamin as core building block, diaminobenzoic 
acid as a monomer (mon) and aspartic acid as a functional group. 



Molecular size 



0 


900 


1 


788 


2 


1785 


3 


3779 


4 


7767 


5 


15743 


6 


31694 


7 


63597 



5 The dendrimer may comprise at least one core. According to specific embodiments, the 
core may include divalent, trivalent, tetravalent, multivalent cores and mixtures thereofl- * 
According to the invention, the at least one core may be selected from the group consisting 
of the formulas in figure 6, which is identical to figure 3 of SE 0103103-8, and mixtures 
thereof. 

10 

In one embodiment, the core may be a diamine where n— 2 or a triamine where n=l . 

The qore may contain further branching units, allowing the treelike structure characteristic 
of the dendrimer to form* 
1 5 An example of a dendrimer uses diaminoethane as the core. 



The dendrimer also comprises at least one monomer. The monomer may include further 
branching possibilities to the dendriirier molecule to achieve the treelike structure - 
characteristic of the dendrimer.- Due to the numbers of branching possibilities, the 
20 monomer may be monovalent, i.e., to elongate without branching, divalent, trivalent, 

tetravalent, multivalent or mixtures thereof. Examples of such monomers with monovalent 
and divalent branching units are shown in figure 8, which is identical to figure 5 of SE 
0103103-8. 



25 Specific embodiments use amid bondings between the different monomers. 
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In still a further embodiment, the at least one monomer may be 3,5- diaminobenzoic acid,* 
as shown in figure is shown in figure 7, which is identical to figure 4 of SE 0103103-8. 
The number of monomers will contribute to the final molecular size of . the dendrimer: 
5 • According to specific embodiments, using 3,5- diaminobenzoic acid as monomer the 
monomer may be in a number of 1 - 1 0. 

The dendrimer may also contain a functional group according to the invention. The at least 
one functional group will to the dendrimer add known characteristics. An example of 
1 0 known characteristics is, e.g. a desired net charge to the molecule. 

In specific embodiments, at least one functional group may be added to the free ends 
generated in the coupling step. Due to the addition of the at least one fiinctional group, the 
landmark will according to the invention be able to position in the gel if the gel parameters 
- 15 used forsfeparatfan are selected so as tb enable separation of the marker compound ' '' '' ' ™ 
characteristics. 

According to different embodiments the at least one functional group may be selected from 
the group of zwitterions, anionic or cationic, oligopeptides, alcohols, tiols, carboxy acids, 
20 amines; fluorochromes, such as fluorescamine, isotopes and mixtures thereof . In a specific 
embodiment, the fluorescamine used is the commercially available Fluram® available 
from Molecular Probes, U.K. In still further specific embodiments, the at least one 
functional group is selected from the group consisting of an amino* acid such as aspartic 
acid, glutamic acid. 

25 

According to the invention, the marker compound has known characteristics affecting its 
migration in a gel during gel electrophoresis so as to position the marker compound in said 
gel. Such characteristics may, of course, be included by the addition of a functional group 
to the dendrimer. Even further, it may reflect the molecular size of the marker compound* 

30 

By selecting landmarks with chosen characteristic, such as, e.g. pi and molecular size 
values, the external added landmarks may position in a desired way over the gel. Such 
information about characteristics of the external markers may be used for further analysis 
of unknown samples, such as unknown proteins. 



W 0 03/058230 y PCT/SE03/00029 

According to the Invention, a set of marker substances may form at least two: marker 
spots In a gel after an electrophoresis run. As such, these spots will appear in different 
areas in the gel, due to the known characteristics' of the marker compounds that form 
said spots, separated in at least one dimension, the number of marker spots may 

5 differ. This Is due to the large number of different types of gels to be used in the gel 
electrophoresis step. Another factor that may affect the number of markers needed are 
the number of sample spots on the gel, which is obvious for the skilled man in the art 
of gel electrophoresis and separation of samples such as e.g., protein samples. Other 
factors that affect the number of spots needed are the number of separation 

10 dimensions. The invention is applicable to electrophoresis separation In at least one 
dimension and preferably two, but may be applied in separation in more than two 
dimensions. The number of marker spots needed -may therefore be, such as about 0- 
50*X> of the number of sample spots, preferably i-20%, e.g. 1-10% of the number of 
• sample spots. Specific embodiments of the invention may use marker spots, wherein . 

15 the at least two marker spots may be from about 2-1000 marker spots per said gel. In 
sti!l further embodiments, the at feast two marker spots may be from about 2-500, 
"~ 10-250, 10-100, or about 20^40 marker spots per gel. In an embodiment" Of a two . 
dimensional gel, e.g. a polyacrylamid gel, in a size of about 24x18 cm, the number of 
marker spots needed may be about 9-40, e.g. 9-16 marker spots. 

20 - 

.The grid formed by the distributed marker spots in the gel may be spatially even, but 
the marker set may also be selected such that the marker spots will appear unevenly, 
distributed for various reasons. One such reason may be that Is known or suspected 
beforehand that certain sample substances will b? present in the gel and will gather in 

25 a specific region of the gel. It may then be desira'ble'to select markers which will 

equally be gathered in the vicinity of this region, in order to have a sufficient amount 
of landmark readings in this region In order to improve the sample quantification 
accuracy. 

30 The marker spots appearing in a gel will contain different number of marker 

compounds per marker spot. However, the amount of marker compounds should be 
enough to allow a clear and concise detection of the marker spot on the gel. This is, of 
course, dependent on how the sample spots are detected, equipment and/or gel . 
resolution. . 1 
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According to specific embodiments of the invention, at least two marker compounds 
are used forming at least two marker spots which may position in the gel at ideal 
positions. As used herein, the word ideal Is intended to mean ideal due to empirical 
information, such as experimental data, or from theoretical information, such as 
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chemical and/or physical data characterising the marker compounds used. Such 
compounds may be designed to be dependent upon e.g. pi and molecular size for their 
positioning. Of course the marker compounds may be designed In different ways, as 
obvious to the skilled man In the art, to depend Upon other characteristics for 
5 separation according to other dimensions than pfand molecular size. 

The set of external landmarks may in specific embodiments comprise at leasttwo 
marker compounds forming at least two spots, wherein the at least two marker 
compounds are characterised by a pi of about l-i2, such as about 3-10. In still further 
embodiments, the at least two marker compounds may be characterised by a Mw of 
10 about lOO-^lO 6 , such as about 10 3 -10 5 . Specific embodiments of the invention may 
include two marker compounds characterised by a pi of about 1-12 and a Mw of about 
■ 100 -10*. 

The application of a set of marker or landmark substances or compounds in a gel may 
15" Include applying the set in the form of application' strips or mixing and applying the set 
together with the test samples, or applying the set at the time of casting of the gel. 

When an electrophoresis separation has been made and the sample and marker 
substances have been positioned into spots in the gel, information about the position 

20 of each separated marker compound needs to be collected and stored. This 

information may be collected using any of the determination processes selected from a 
group comprising visual light, UV f IR, multlspectral imaging, isotope labelling, 
colouring techniques such as silver staining, Comassie staining, and fluorescence 
techniques. The external marker substances are preferably selected such that they are 

25 distinguishable from the sample spots in the gel.-.This may be achieved by using 

marker substances having optical properties different from the optical properties of the 
sample spots, making it possible to detect the external markers In the gel. The marker 
spots are preferably detected by scanning the gel. and picking up signals representing 
the positions of the spots using any of the techniques in the aforementioned group. 

30 The scanning procedure results in a digitized image, which is stored in a data memory. 

*• 

t ' 

The stained gels, both containing separated landmarks and sample molecules, may be 
scanned in a scanner with dual detection possibility. With dual detection' possibility, it 
is intended to mean that the scanner has the possibility to detect signals from 
35 different staining techniques, such as silver staining, fluorescent staining, 

radioactivity, or any other staining method used. In one embodiment, the landmarks 
are detected in a separate image, the marker image, after a scanning step, enabling 
detection of the landmarks positions only. Without changing the position of the gel, 
the parameters of the scanning apparatus are changed, so as to enable a separate 
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scanning and detection of the separated sample molecules, e.g. the proteins, In order 
to register a sample image. The two gel images, one with the separated landmarks 
and one with the separated sample proteins, now collected in digitalized form, are 
subsequently used for the image analysis of the gel. The gel may also be scanned 
5 once, detecting the sample molecules and the proteins in one Image. This may be 
convenient when the same detection methodology Is used for the sample molecules 
and the landmarks, but when the optical and/or the geometrical properties of the 
marker spots and the sample spots allow differentiation between them. 
The marker Image Is used for extraction of the co-ordinates for the spots of external 

10 markers in the Image. This is done by standard techniques in mathematical imaging, 
and any of a number of methods could be used. The result Is a file consisting of data 
representing the co-ordinates from the detected external markers in the gel. 
In accordance with the invention, the properties of the external markers according to 
the two separation parameters are known. For the set of extern a I. markers added to 

15 "the sample prior to the electrophoresis a file of those properties has been established 
and stored in a file on a computer readable storage media, such as a hard drive, a CD- 
ROM or other data storage device. The differences between the known properties and 
the observed positions of the extemail markers on the gel are subsequently used to 
construct a mathematical relation mapping the one set of data onto the other. The 

20 relation should due to obvious reasons be infective and continuous over the whole 
defined image area<> This mathematical relation could be defined and/or determined In 
any of a number of ways. The result is a mathematical function mapping the existing, 
detected, Image of the sample spots, or the sample spots plus the marker spots, onto 
another image, the "globally transformed Image". Below, one example of an 

25 embodiment for carrying out the invention Is described with reference to the 
accompanying drawings. 

A silver stained gel formed in an electrophoresis separation process is placed in a 
scanner and scanned in two different ways, or in a way that allows differentiating the 

30 * marker spots from the sample spots, In order to extract the co-ordinates for the spot 
positions of the selected marker substances. The co-ordinates could typically be 
determined using segmentation of the marker, spots, e.g. by using Watershed 
algorithm. A modified Gaussian distribution model of the marker spots may further be 
used to locate the centre of gravity for the spots and use the projection onto the x-y 

35 surface as the co-ordinates for the spots. The co-ordinate data thus extracted is stored 
in a file. The file of data for the set of detected spots of landmarks substances used in- 
the image, and the file of the properties of said landmark substances, are loaded Into 
a computer carrying a computer program devised in accordance with the invention. 
The computer program now has access to the observed data and the u known" data 
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relating to the landmarks. The known data on file may be specifically stated in the 
format of parameter pairs relating to the separation process used, but may also be 
stored in a form from which this value pair may be calculated dependent on 
electrophoresis conditions. 

5 

• The co-ordinates for the ideal landmark positions are defined and accessible In the 
computer program as vectors named P x and P v . The co-ordinates for the. detected 
landmark positions are defined and accessible in the computer program as vectors V x 
and V v . The x and y relate to the x-coordinates and the y-coordinates of the respective 

10 positions. 

Using the P„ and the P y , all distances between the landmarks in the image plane of . 
the sought normalised image are calculated and stored in a vector r.. 
A function U(r) = r 2 * log r 2 Is then calculated for all the r's, and stored in a matrix K. 
15 This matrix is thus a n*n matrix over the n landmarks in the following manner: 

0 t/(r n ) ... tf(r to ) 
R= 0 ... Uir^^ 

a(r ttl ) V(r n2 ) ... 0 

1 x x y y 

A matrix P is constructed: P = [ ] , that is a 3 * n matrix: [1 P* Py J. 

1 ' y* 

20 Then a matrix or operator L is constructed according to: 
K P 

L - [ p. ^ ], (n + 3) * (n + 3), where P T is the: transpose of the P matrix and 0 is a 
3*3 matrix of zeros. 

25 Let V = (v X/ v 2 , ... , v n ) be any n-vector, and write Y = ( V | 0 0 0) T , a column vector of 

• length n + 3. A vector W = (w x , w 2 , ... , w n )and coefficients a x , a x , a y are then defined 
by the equation L^Y = (W | a x a x a y ) T . According to the invention the elements of L"*Y 
are subsequently used to define a function f(x,y) everywhere in the plane: 

30 

f{x, y) = a l +a x *x + a y *y + '£w i UQP l -(x 9 y)\). 
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The function f is divided into two parts: a sum of functions U(r) and an affine part. 



5 Let V - [*' *? "*• X " 1 where each landmark homologous to (x,, y t ) in another copy 

y\ y\ - y.n . 

of R 2 . Here, R represents the real numbers, and consequently R 2 represents the real 
number pairs in the plane. 



10 The application of L'* to the first column of V T specifies the coefficients of 1, x, y, and 
the U's for f*(x,y), the x-coordinate of the image of (x, y); the application of L" 1 to the 
second column of V T does the same for the y-coo'rdinate f y (x,y). The resulting function 
f(x,y) - £fx(x,y), f y (x,y)] is now vector-valued: it maps each point (xi, y t ) to its 
homoiog (x,', y,'). furthermore, this function is the least bent of all such functions, 

15 according to the measure I f , integral quadratic variation over all R 2 , computed 
separately for real and imaginary parts of f and summed. These vector-valued 
functions f(x,y) are the thin-plate spline mappings used in Oils method. If the pairings 
of points between the sets is in accordance with biological homology, the function f 
models the comparison of biological forms as a deformation. The whole procedure is 

20 obviously invariant under translation or .rotation of either set of landmarks. 

The application of L" 1 to both the columns of V T thus specifies the coefficients for the 
transform between the to sets of landmarks. The coefficients are stored in the matrix, 
"koeff". 



25 



The coefficients are' now applied to all the pixels in the sought normalized image, 
pixel-by-pixel, and the corresponding pixel value In the distorted Image is determined. 



For this calculated pixel in the distorted image, the grey-level (0-255) is determined. 
30 To determine the grey-level for a non-integer pixel value, some sort of interpolation 
can be used. In our example a discrete bi-linear interpolation is used. The grey-level 
thus determined are appointed to the pixel in the sought normalized image. 

The next pixel in the sought normalized image is handled in the same way. In cases 
35 where the transform is mapping the pixel in the sought image to a pixel- value outside 
the defined area in the distorted image, the pixel' In the sought image is set to 0 
(black). • 
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When all pixels are transformed, and appointed a grey-level, the sought Image Is 
created and can be plotted and handled as a new Image, corrected according to the 
informatlon>rom the pair of landmark sets. 

Each point in the sought normalized image plane is mapped by a function 

5 

ix,y) -> (x',y) = (x+f x {x,y),y + f y {x,y)), 

where the f x and f y are the function based on £(-l)*C/(l {x,y)-D k D as viewed in 3 
dimensions. 

10 

Although several examples of embodiments have been described a person skilled in 
the art will realise then further modifications are possible within the scope of the 
appended claims. 
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